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(57) ABSTRACT 

A flexible, policy-based, mechanism for managing, 
monitoring, and prioritizing traffic within a network and 
allocating bandwidth to achieve true quality of service 
(QoS) is provided. According to one aspect of the present 
invention, a method is provided for managing bandwidth 
allocation in a network that employs a non-deterministic 
access protocol, such as an Ethernet network. A packet 
forwarding device receives information indicative of a set of 
traffic groups, such as: a MAC address, or IEEE 802. lp 
priority indicator or 802. 1Q frame tag, if the QoS policy is 
based upon individual station applications; or a physical port 
if the QoS policy is based purely upon topology. The packet 
forwarding device additionally receives bandwidth param- 
eters corresponding to the traffic groups. After receiving a 
packet associated with one of the traffic groups on a first 
port, the packet forwarding device schedules the packet for 
transmission from a second port based upon bandwidth 
parameters corresponding to the traffic group with which the 
packet is associated. According to another aspect of the 
present invention, a method is provided for managing band- 
width allocation in a packet forwarding device. The packet 
forwarding device receives information indicative of a set of 
traffic groups. The packet forwarding device additionally 
receives information defining a QoS policy for the traffic 
groups. After a packet is received by the packet forwarding 
device, a traffic group with which the packet is associated is 
identified. Subsequently, rather than relying on an end-to- 
end signaling protocol for scheduling, the packet is sched- 
uled for transmission based upon the QoS policy for the 
identified traffic group. 

28 Claims, 5 Drawing Sheets 
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POLICY BASED QUALITY OF SERVICE IEEE 802.1 Q draft standard provides a packet format for an 

application to specify which Virtual Local Area Network 

This is a continuation of application Ser. No. 09/018,103, (VLAN) a packet belongs to and the priority of the packet, 

filed on Feb. 3, 1998, now issued U.S. Pat. No. 6,104,700. ^ IEEE % 02 - l V committee provides a guideline to classify 

This application claims the benefit of U.S. Provisional 5 traffic based on a priority indicator in an 802.1Q frame tag. 

Application No. 60/057,371, filed Aug. 29, 1997. ™* allows VLANs t0 be 8 rcm P ed into ei g ht different traffic 

classes or priorities. The IEEE 802. lp committee does not, 

COPYRIGHT NOTICE however, define the mechanism to service these traffic 

classes. 

Contained herein is material that is subject to copyright What is needed is a way to provide true Quality of Service 

protection. The copyright owner has no objection to the ("QoS") in a network employing a non-deterministic access 

facsimile reproduction of the patent disclosure by any per- protocol, such as an Ethernet network, that not only has the 

son as it appears in the Patent and Trademark Office patent ability to prioritize and service different traffic classes, but 

files or records, but otherwise reserves all rights to the additionally provides bandwidth management and guaran- 

copyright whatsoever. lees a quantifiable measure of service for packets associated 

15 with a particular traffic class. More specifically, with respect 

BACKGROUND OF THE INVENTION to bandwidth management, it is desirable to employ a 

weighted fair queuing delivery schedule which shares avail- 

1. Field of the Invention aD i e bandwidth so that high priority traffic is usually sent 
The invention relates generally to the field of computer first, but low priority traffic is still guaranteed an acceptable 

networking devices. More particularly, the invention relates 20 minimum bandwidth allocation. Also, it is desirable to 

to a flexible, policy-based mechanism for managing, centralize the control over bandwidth allocation and traffic 

monitoring, and prioritizing traffic within a network and priority to allow for QoS without having to upgrade or alter 

allocating bandwidth to achieve true Quality of Service end-stations and existing routers as is typically required by 

(QoS). end-to-end protocol solutions. Further, it would be advan- 

2. Description of the Related Art 25 kg«ous to put the control in the hands of network managers 
KW i*a=*j • j . jlj by performing bandwidth allocation and traffic prioritization 
Network traffic today is more diverse and bandwidth- « . * r j * j j • ■ * *■ i- ■ 

. r r-, j , . t t * ji * based upon a set ol manager-defined administrative policies, 

intensive than ever before. Today s intranets are expected to i- n ■ *l ^ ^ c * i * i 

t . l4 . j. x. « • f Finally, since there are many levels of control a network 

support interactive multimedia, full-motion video, nch J A A 4 , . . *\ . , . L1 4 

rr V, . jj--.1l. t_r- t_< manager may elect to administer, it is desirable to provide a 

graphic unages and digital photography. Expectations about . ' & f v , t . , . . , r 4 - 

fu f* j *■ i . * • c *j • a 30 variety of scheduling mechanisms based upon a core set of 

the quality and timely presentation of information received A „ ; 41 - * * 

c *_ i . i_- l ,l t j QoS profile attributes, 

from networks is higher than ever. Increased network speed r 

and bandwidth alone will not satisfy the high demands of BRIEF SUMMARY OF THE INVENTION 

today's intranets. A flexible, policy-based, mechanism for managing, 

The Internet Engineering Task Force (IETF) is working 35 monitoring, and prioritizing traffic within a network and 

on a draft standard for the Resource Reservation Protocol allocating bandwidth to achieve true Quality of Service 

(RSVP), an Internet Protocol-(IP) based protocol that allows (QoS) is described. According to one aspect of the present 

end-stations, such as desktop computers, to request and invention, a method is provided for managing bandwidth 

reserve resources within and across networks. Essentially, allocation in a network that employs a non-determninistic 

RSVP is an end-to-end protocol that defines a means of ^ access protocol. A packet forwarding device receives infor- 

communicating the desired Quality of Service between mation indicative of a set of traffic groups. The packet 

routers. RSVP is receiver initiated. The end-station that is forwarding device additionally receives parameters, such as 

receiving the data stream communicates its requirements to bandwidth and priority parameters, corresponding to the 

an adjacent router and those requirements are passed back to traffic groups. After receiving a packet associated with one 

all intervening routers between the receiving end -station and 45 of the traffic groups on a first port, the packet forwarding 

the source of the data stream and finally to the source of the device schedules the packet for transmission from a second 

data stream itself. Therefore, it should be apparent that port based upon parameters corresponding to the traffic 

RSVP must be implemented across the whole network. That group with which the packet is associated. Advantageously, 

is, both end-stations (e.g., the source and destination of the in this manner, a weighted fair queuing schedule that shares 

data stream) and every router in between should be RSVP 50 bandwidth according to some set of rules may be achieved, 

compliant in order to accommodate the receiving end- According to another aspect of the present invention, a 

station's request. method is provided for managing bandwidth allocation and 

While RSVP allows applications to obtain some degree of traffic prioritization in a packet forwarding device. The 

guaranteed performance, it is a first-come, first-served packet forwarding device receives information indicative of 

protocol, which means if there are no other controls within 55 a set of traic groups. The packet forwarding device addi- 

the network, an application using RSVP may reserve and tionally receives information defining a Quality of Service 

consume resources that could be needed or more effectively (QoS) policy for the traffic groups. After a packet is received 

utilized by some other mission-critical application. A further by the packet forwarding device, a traffic group with which 

limitation of this approach to resource allocation is the fact the packet is associated is identified. Subsequently, rather 

that end-stations and routers must be altered to be RSVP 6 o than relying on an end-to-end signaling protocol for 

compliant. Finally, RSVP lacks adequate policy mechanisms scheduling, the packet is scheduled for transmission based 

for allowing differentiation between various traffic flows. It upon the QoS policy for the identified traffic group, 

should be appreciated that without a policy system in place, Therefore, bandwidth allocation and traffic prioritization are 

the network manager loses control. based upon a set of administrative policies over which the 

Recent attempts to facilitate traffic differentiation and 65 network manager retains control, 

prioritization include draft standards specified by the Insti- According to yet another aspect of the present invention, 

tute of Electrical and Electronics Engineers (IEEE). The a number of QoS queues are provided at each port of the 



10/06/2004, EAST Version: 1.4.1 



US 6,6' 

3 

packet forwarding device. A current bandwidth metric is 
determined for each of the QoS queues for a particular port. 
The QoS queues are divided into two groups based upon 
their respective bandwidth metrics and their respective mini- 
mum bandwidth requirements. Subsequently, the groups are 
used as a first level arbitration mechanism to select a QoS 
queue that will source the next packet. 

Other features of the present invention will be apparent 
from the accompanying drawings and from the detailed 
description which follows. 

BRIEF DESCRIPTION OF THE SEVERAL 
VIEWS OF THE DRAWINGS 

The present invention is illustrated by way of example, 
and not by way of limitation, in the figures of the accom- 
panying drawings and in which like reference numerals refer 
to similar elements and in which: 

FIG. 1A is a simplified block diagram of an exemplary 
switch architecture in which one embodiment of the present 
invention may be implemented. 

FIG. IB is a logical view of the interaction between 
switch processing blocks according to one embodiment of 
the present invention. 

FIG. 2 is a flow diagram illustrating high level bandwidth 
management and traffic prioritization processing according 
to one embodiment of the present invention. 

FIG. 3 is a flow diagram illustrating periodic evaluation of 
QoS categories according to one embodiment of the present 
invention. 

FIG. 4 is a flow diagram illustrating next packet sched- 
uling according to one embodiment of the present invention, 

DE TAILED DESCRIPTION OF THE 
. INVENTION 

A flexible, policy-based, mechanism for managing, 
monitoring, and prioritizing trafl&c within a network and 
allocating bandwidth to achieve true Quality of Service 
(QoS) is described. "Quality of Service" in this context 
essentially means that there is a quantifiable measure of the 
service being provided. The measure of service being pro- 
vided may be in terms of a packet loss rate, a maximum 
delay, a committed minimum bandwidth, or a limited maxi- 
mum bandwidth, for example. 

In the present invention, a number of QoS queues may be 
provided at each port of a packet forwarding device, such as 
a Local Area Network (LAN) switch. Based upon a set of 
QoS parameters, various types of traffic can be distinguished 
and associated with particular QoS queues. For example, 
packets associated with a first traffic group may be placed 
onto a first QoS queue and packets associated with another 
traffic group may be placed onto a second QoS queue. When 
a port is ready to transmit the next packet, a scheduling 
mechanism may be employed to select which QoS queue of 
the QoS queues associated with the port will provide the 
next packet for transmission. 

In the following description, for the purposes of 
explanation, numerous specific details are set forth in order 
to provide a thorough understanding of the present inven- 
tion. It will be apparent, however, to one skilled in the art 
that the present invention may be practiced without some of 
these specific details. In other instances, well-known struc- 
tures and devices are shown in block diagram form. 

The present invention includes various steps, which will 
be described below. The steps of the present invention may 
be performed by hardware components or may be embodied 
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in machine-executable instructions, which may be used to 
cause a general-purpose or special-purpose processor pro- 
grammed with the instructions to perform the steps. 
Alternatively, the steps may be performed by a combination 
5 of hardware and software. While, embodiments of the 
present invention will be described with reference to a high 
speed Ethernet switch, the method and apparatus described 
herein are equally applicable to other types of network 
devices or packet forwarding devices, 

10 

An Exemplary Switch Architecture 

An overview of the architecture of a switch 100 in which 
one embodiment of the present invention may be imple- 
mented is illustrated by FIG. 1A The central memory 

15 architecture depicted includes multiple ports 105 and 110 
each coupled via a channel to a filtering/forwarding engine 
115. Also coupled to the filtering/forwarding engine 115 is 
a forwarding database 120, a packet Random Access 
Memory (RAM) 125, and a Central Processing Unit (CPU) 

20 130. 

According to one embodiment, each channel is capable of 
supporting a data transfer rate of one gigabit per second in 
the transmit direction and one gigabit per second in the 
receive direction, thereby providing 2 Gb/s full-duplex capa- 
bility per channel. Additionally, the channels may be con- 
figured to support one Gigabit Ethernet network connection 
or eight Fast Ethernet network connections. 

The filtering/forwarding engine 115 includes an address 

30 filter (not shown), a switch matrix (not shown), and a buffer 
manager (not shown). The address filter may provide 
bridging, routing, Virtual Local Area Network (VLAN) 
tagging functions, and traffic classification. The switch 
matrix connects each channel to a central memory such as 

35 packet RAM 125. The buffer manager controls data buffers 
and packet queue structures and controls and coordinates 
accesses to and from the packet RAM 125. 

The forwarding database 120 may store information use- 
ful for making forwarding decisions, such as layer 2 (e.g., 

4Q Media Access Control (MAC) layer), layer 3 (e.g., Network 
layer), and/or layer 4 (e.g., Transport layer) forwarding 
information, among other things. The switch 100 forwards a 
packet received at an input port to an output port by 
performing a search on the forwarding database using 

45 address information contained within the header of the 
received packet. If a matching entry is found, a forwarding 
decision is constructed that indicates to which output port 
the received packet should be forwarded, if any. Otherwise, 
the packet is forwarded to the CPU 130 for assistance in 

5 Q constructing a forwarding decision. 

The packet RAM 125 provides buffering for packets and 
acts as an elasticity buffer for adapting between incoming 
and outgoing bandwidth differences. Packet buffering is 
discussed further below. 

Logical View of Exemplary Switch Processing 

FIG. IB is a logical view of the interaction between 
exemplary switch processing blocks that may be distributed 
throughout the switch 100. For example, some of the pro- 

60 cessing may be performed by functional units within the 
ports of the switch and other processing may be performed 
by the CPU 130 or by the address filter/switch matrix/buffer 
manager 115. In any event, the processing can be concep- 
tually divided into a first group of functions 160 dedicated 

65 to input processing and a second group of functions 185 
dedicated to output processing. According to the present 
embodiment, the first group 160 includes a comparison 
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engine 155, an enqueue block 161, a packet classification 
block 150, and a buffer manager 165. The second group 185 
includes a dequeue block 162, a Quality of Service (QoS) 
category evaluation block 175, and a scheduler 170. 

Additionally, a user interface (UI) 145 may be provided 5 
for receiving various parameters from the network manager. 
The UI may be text based or graphical. In one embodiment, 
the UI 145 may include an in-band HyperText Markup 
Language (HTML) browser-based management tool which 
may be accessed by any standard web browser. In any event, 10 
the goal of the UI 145 is to separate high-level policy 
components, such as traffic grouping and QoS profiles from 
the details of the internal switch hardware. Thus, user 
configuration time is minimized and a consistent interface is 
provided to the user. 15 

The UI 145 receives information indicative of one or 
more traffic groups. This information may be provided by 
the network manager. There are several ways to define a 
traffic group. Table 1 below illustrates a variety of traffic 
classification schemes that may be supported by the UI 145. 20 

TABLE 1 



Policy Based Upon 



Traffic Classifies tioa 



Traffic Group Definition OSI Layer 



Applications 



Transport Layer 



25 



30 



TCP Session 
UDP Session 
RSVP Flow 

Network Layer Network Layer Protocol Network Layer 

Topology or Groups of Subnet or LP Address 

Users VLAN Identifier 

End-Station Applications MAC Address Link Layer 

802. lp or 802. 1Q 
Physical Topology Physical Port Physical Layer 



The information used to identify a traffic group typically 
depends upon what terms the QoS policy is defined. If the 
QoS policy is based on applications, traffic groups may be 
differentiated at the Transport layer by Transmission Control 
Protocol (TCP) session or User Datagram Protocol (UDP) 40 
session. For example, the network manager may provide 
information indicative of TCP source and destination ports 
and IP source and destination addresses to identify traffic 
groups. However, if the QoS policy is based upon the 
Network layer topology or groups of users, traffic group 45 
definition may be more convenient by supplying informa- 
tion regarding the Network layer protocol, such as Internet 
Protocol (IP) or Internetwork Packet Exchange (IPX), the 
subnet or IP addresses, or VLAN identifiers. If the QoS 
policy is defined by end-station applications, then Media 50 
Access Control (MAC) addresses, IEEE 802. lp priority 
indications, or IEEE 802. 1Q frames may be employed to 
identify traffic groups. Finally, if the QoS policy is physical 
topology based, physical port identifiers may be used to 
differentiate traffic groups. 55 

It should be noted that Table 1 merely presents an 
exemplary set of traffic group identification mechanisms. 
From the examples presented herein, additional, alternative, 
and equivalent traffic grouping schemes and policy consid- 
erations will be apparent to those of ordinary skill in the art. eo 
For example, other state information may be useful for 
purposes of packet classification, such as the history of 
previous packets, the previous traffic load, the time of day, 
etc. 

It is appreciated that traffic classifications based upon the 65 
traffic group definitions listed above may result in overlap. 
Should the network manager define overlapping traffic 



groups, the UI 145 may issue an error message and reject the 
most recent traffic group definition, the UI 145 may issue a 
warning message to the network manager and allow the 
more specific traffic group definition to override a conflict- 
ing general traffic group definition, or the UI 145 may be 
configured to respond in another manner. 

A number of QoS queues 180 may be provided at each of 
the ports of a packet forwarding device. In one embodiment, 
a mapping of traffic groups to QoS queues 180 may be 
maintained. As traffic groups are provided by the network 
manager, the UI 145 updates the local mapping of traffic 
groups to QoS queues 180. This mapping process may be a 
one-to-one mapping of the traffic groups defined by the 
network manager to the QoS queues 180 or the mapping 
process may be more involved. For example, there may be 
more traffic groups than QoS queues 180, in which case, 
more than one traffic group will be mapped to a single QoS 
queue. Some consolidation rules for combining multiple 
traffic groups into a single QoS queue will be discussed 
below. 

At any rate, by providing a layer of abstraction in this 
manner, the network manager need not be burdened with the 
underlying implementation details, such as the number of 
QoS queues per port and other queuing parameters. Another 
advantage achieved by this layer of abstraction between the 
traffic group definitions and the physical QoS queues is the 
fact that the UI 145 is now decoupled from the underlying 
implementation. Therefore, the UI 145 need not be updated 
if the hardware QoS implementation changes. For example, 
software providing for traffic group definition need not be 
changed simply because the number of QoS queues per port 
provided by the hardware changes. 

The input data stream is received by the comparison 
engine 155 from input switch ports (not shown). Under the 
direction of the packet classification process 150, the com- 
parison engine 155 determines with which of the previously 
defined traffic groups a packet in the data stream is associ- 
ated. The packet classification block 150 may employ the 
traffic group indications provided by the network manager to 
provide the comparison engine 155 with information regard- 
ing locations and fields to be compared or ignored within the 
header of a received packet, for example. It should be 
appreciated if the comparison required for traffic classifica- 
tion is straightforward, such as in a conventional packet 
forwarding device, then the comparison engine 155 and the 
packet classification block 150 may be combined. 

The packet classification block 150 in conjunction with 
the UI 145 provide a network manager with a flexible 
mechanism to control traffic prioritization and bandwidth 
allocation through the switch 100. Importantly, no end-to- 
end signaling protocol needs to be implemented by the 
network devices. For example, the end-station that is to 
receive the data stream need not reserve bandwidth on each 
of the intermediate devices between it and the source of the 
data stream. Rather, a packet forwarding device employing 
the present invention can provide some benefit to the net- 
work without requiring routers and/or end-stations to do 
anything in particular to identify traffic. Thus, traffic priority 
may be enforced by the switch 100 and QoS may be 
delivered to applications without altering routers or end- 
stations. 

According to one embodiment, the buffer manager 165 
participates in policy based QoS by controlling the alloca- 
tion of buffers within the packet RAM 125. Buffers may be 
dynamically allocated to QoS queues 180 as needed, within 
constraints established by QoS profile attributes, which are 
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discussed below. The buffer manager 165 may maintain 
several programmable variables for each QoS queue. For 
example, a Minimum Buffer Allocation and a Maximum 
Queue Depth may be provided for each QoS queue. The 
Minimum Buffer Allocation essentially reserves some mini- 5 
mum number of buffers in the packet RAM 125 for the QoS 
queue with which it is associated. The Maximum Queue 
Depth establishes the maximum number of buffers that can 
be placed on a given QoS queue. The buffer manager 165 
also maintains a Current Queue Depth for each QoS queue 
to assure the maximum depth is not exceeded. For example, 
before allowing a buffer to be added to a given QoS queue, 
the buffer manager 165 may compare the Maximum Queue 
Depth to the Current Queue Depth to ensure the Maximum 
Queue Depth is not exceeded. 

Variables are also maintained for tracking free buffers in 15 
the packet RAM 125. At initialization, a Buffers Free Count 
contains the total number of buffers available in the packet 
RAM 125 and a Buffers Reserved Count contains the sum of 
the minimum buffer allocations for the QoS queues 180. As 
packets are received they are stored in free buffers, and the 20 
Buffers Free Count is decremented by the number of buffers 
used for such storage. After the appropriate QoS queue has 
been identified the buffer manager 165 instructs the enqueue 
block 161 to add the packet to the QoS queue. The enqueue 
block 161 links the packet to the identified queue provided 25 
that tbe Current Queue Depth is less than the Maximum 
Queue Depth and either (1) the Current Queue Depth is less 
than the Minimum Buffer Allocation or (2) the Buffers 
Reserved Count is less than the Buffers Free Count. 
Therefore, if a QoS queue exceeds its reserve of buffers 3Q 
(e.g., Minimum Buffer Allocation), to the extent that addi- 
tional buffers remain free, the QoS queue may continue to 
grow. Otherwise, the enqueue block 161 will discard the 
packet, the buffers are returned to the free pool, and the 
Buffers Free Count is increased by the number of buffers that 
would have been consumed by the packet. When a packet is 35 
successfully linked to a QoS queue, the Current Queue 
Depth for that QoS queue is increased by the number of 
buffers used by the packet. If, prior to the addition of the 
packet to the queue, the Current Queue Depth was less than 
the Minimum Buffer Allocation then the Buffers Reserved 40 
Count is decreased by the lesser of (1) the number of buffers 
in the packet or (2) the difference between the Current 
Queue Depth and the Minimum Buffer Allocation. 

The QoS category evaluation process 175 separates the 
QoS queues into a plurality of categories based upon a set of 45 
bandwidth parameters. The scheduler 170 uses the grouping 
provided by the QoS category evaluation process 175 to 
select an appropriate QoS queue for sourcing the next packet 
for a particular port. The evaluation of QoS queue categories 
may be performed periodically or upon command by the 50 
scheduler 170, for example. Periodic evaluation of QoS 
categories and scheduling is discussed in further detail 
below. 

Responsive to the scheduler 170 the dequeue block 162 
retrieves a packet from a specified QoS queue. After the 55 
packet has been transmitted, the buffer variables are 
updated. The Buffers Free Count is increased and the 
Current Queue Depth is decreased by the number of buffers 
utilized to store the packet. If the resulting Current Queue 
Depth is less than the Minimum Buffer Allocation, then the 
Buffers Reserved Count is increased by the lesser of the 60 
number of buffers utilized to store the packet or the differ- 
ence between the Current Queue Depth and the Minimum 
Buffer Allocation. 

QoS Profile Attributes 65 

Setting QoS policy is a combination of identifying traffic 
groups and defining QoS profiles for those traffic groups. 
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According to one embodiment, each individual traffic group 
may be associated with a QoS profile. However, in alterna- 
tive embodiments, multiple traffic groups may share a com- 
mon QoS profile. Having described traffic group classifica- 
tion and identification above, QoS profile attributes (also 
referred to as parameters) will now be discussed. 

Several queuing mechanisms may be implemented using 
one or more of the following parameters associated with a 
traffic group: (1) minimum bandwidth, (2) maximum 
bandwidth, (3) peak bandwidth, (4) maximum delay, and (5) 
relative priority. In general, the minimum, maximum, and 
peak bandwidth parameter may be expressed in Mbps, a 
percentage of total bandwidth, or any other convenient 
representation. 

Minimum bandwidth indicates the minimum amount of 
bandwidth a particular traffic group needs to be provided 
over a defined time period. If the sum of the minimum 
bandwidths for all traffic groups defined is less than 100% of 
the available bandwidth, then the scheduling processing, 
discussed below, can assure that each traffic group will 
receive at least the minimum bandwidth requested. 

Maximum bandwidth is the maximum sustained band- 
width the traffic group can realize over a defined time period. 
In contrast, peak bandwidth represents the bandwidth a 
traffic group may utilize during a particular time interval in 
excess of the maximum bandwidth. The peak bandwidth 
parameter may be used to limit traffic bursts for the traffic 
group with which it is associated. The peak bandwidth also 
determines how quickly the traffic group's current band- 
width will converge to the maximum bandwidth. By pro- 
viding a peak bandwidth value that is much higher than the 
maximum bandwidth, if sufficient bandwidth is available, 
the maximum bandwidth will be achieved relatively quickly. 
In contrast, a peak bandwidth that is only slightly higher that 
the maximum bandwidth will cause the convergence to the 
maximum bandwidth to be more gradual. 

Maximum delay specifies a time period beyond which 
further delay cannot be tolerated for the particular traffic 
group. Packets comprising the traffic group that are for- 
warded by the switch 100 are guaranteed not to be delayed 
by more than the maximum delay specified. 

Relative priority defines the relative importance of a 
particular traffic group with respect to other traffic groups. 
As will be discussed further below, within the same QoS 
category, traffic groups with a higher priority are preferred 
over those with lower priorities. 

This small set of parameters in combination with the 
variety of traffic classification schemes gives a network 
manager enormous control and flexibility in prioritizing and 
managing traffic flowing through packet forwarding devices 
in a network. For example, the QoS profile of a video traffic 
group, identified by UDP session, might be defined to have 
a high priority and a minimum bandwidth of 5 Mbps, while 
the QoS profile of an engineering traffic group, identified by 
VLAN, may be set to a second priority, a minimum band- 
width of 30 Mbps, a maximum bandwidth of 50 Mbps, and 
a peak bandwidth of 60 Mbps. Concurrently, the QoS profile 
of a World Wide Web (WWW) traffic group, identified by 
protocol (e.g., IP), may be set to have a low ' priority, a 
minimum bandwidth of 0 Mbps, a maximum bandwidth of 
100%, and a peak bandwidth of 100%. 

Consolidation Rules 

It was mentioned earlier that multiple traffic groups may 
be mapped to a single QoS queue. This may be accom- 
plished by maintaining an independent set of variables (e.g., 
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minimum bandwidth, maximum bandwidth, peak 
bandwidth, maximum delay, and relative priority) for each 
QoS queue in addition to those already associated with each 
traffic group and following the general consolidation rules 
outlined below. 

Briefly, when the mapping from traffic groups to QoS 
queues is one-to-one, the determination of a particular QoS 
queues' attributes is straightforward. The QoS queue's 
attributes simply equal the traffic group's attributes. 
However, when combining multiple traffic groups that do 
not share a common QoS profile onto a single QoS queue, 
the following general consolidation rules are suggested: (1) 
add minimum attributes of the traffic groups being combined 
to arrive at an appropriate minimum attribute for the target 
QoS queue (e.g., the QoS queue in which the traffic will be 
merged), (2) use the largest of maximum attributes to arrive 
at an appropriate value for a maximum attribute for the 
target QoS queue, and (3) avoid merging traffic groups that 
have different relative priorities. This last rule suggests the 
number of priority levels provided should be less than or 
equal to the number of QoS queues supported by the 
implementation to assure traffic groups with different pri- 
orities are not combined in the same QoS queue. 

Importantly, when a network manager has determined that 
multiple traffic groups will share a common QoS profile, the 
consolidation rules need not apply, as the network manager 
has already, in effect, manually consolidated the parameters. 

Bandwidth Management and Traffic Prioritization 

Having described an exemplary environment in which 
one embodiment of the present invention .may be 
implemented, bandwidth management and traffic prioritiza- 
tion will now be described with reference to FIG. 2. FIG. 2 
is a flow diagram illustrating the high level bandwidth 
management and traffic prioritization processing according 
to one embodiment of the present invention. In this 
embodiment, at step 210, a manager-defined QoS policy 
may be received via the UI 145, for example. The QoS 
policy is a combination of traffic groups and QoS profile 
attributes corresponding to those traffic groups. 

At step 220, a packet is received by the switch 100. Before 
the packet can be placed onto a QoS queue for transmission, 
the traffic group to which the packet belongs is identified at 
step 230. Typically, information in the packet header, for 
example, can be compared to the traffic group criteria 
established by the network manager to identify the traffic 
group to which the packet belongs. This comparison or 
matching process may be achieved by programming filters 
in the switch 100 that allow classification of traffic. Accord- 
ing to one embodiment, the packet may be identified using 
the traffic group definitions listed in Table 1 . 

At step 250, enqueue processing is performed. The packet 
is added to the rear of the appropriate QoS queue for the 
identified traffic group. Importantly, if a maximum delay has 
been assigned to the traffic group with which the packet is 
associated, then the packet should either be dropped or 
transmitted within the period specified. According to one 
embodiment, this may be accomplished by limiting the 
depth (also referred to as length) of the corresponding QoS 
queue. Given the minimum bandwidth of the QoS queue and 
the maximum delay the traffic group can withstand, a 
maximum depth for the QoS queue can be calculated. If the 
QoS queue length remains less than or equal to the maxi- 
mum length, then the packet is added to the QoS queue. 
However, if the QoS queue length would exceed the maxi- 
mum length by the addition, then the packet is dropped. 
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At step 260, scheduling is performed. The scheduling/ 
dequeuing processing involves determining the appropriate 
QoS queue group, selecting the appropriate QoS queue 
within that QoS queue group, and removing the packet at the 
5 front of the selected QoS queue. This selected packet will be 
the next packet the port transmits. Scheduling will be 
discussed further below. 

Evaluation of QoS Categories 

10 According to one embodiment of the present invention, it 
is advantageous to divide the QoS queues into at least two 
categories. The categories may be defined based upon the 
maximum bandwidth, the minimum bandwidth, the peak 
bandwidth, and the "current bandwidth." The current band- 

35 width should not be mistaken for a bandwidth at an instant 
in time, rather the current bandwidth is a moving average 
that is updated periodically upon the expiration of a prede- 
termined time period. Empirical data suggests this predeter- 
mined time period should be on the order of ten packet 

20 times, wherein a packet time is the time required to transmit 
a packet. However, depending upon the environment and the 
nature of the traffic, a value in the range of one to one 
hundred packet times may be more suitable. 

The members of the first category ("Category A") are 
those QoS queues which have a current bandwidth that is 
below their peak bandwidth and below their minimum 
bandwidth. Members of the second category ("Category B") 
include those QoS queues that have a current bandwidth that 

3 q is greater than or equal to their minimum bandwidth, but less 
than both their maximum bandwidth and their peak band- 
width. The remaining QoS queues (e.g., those having a 
current bandwidth that is greater than or equal to either the 
peak bandwidth or the maximum bandwidth) are ineligible 

35 for transmission. These QoS queues that are ineligible for 
transmission can be considered a third category ("Category 
C"). With this overview of QoS categories, an exemplary 
process for periodic evaluation of QoS categories will now 
be described. 

4Q FIG. 3 is a flow diagram illustrating periodic evaluation of 
QoS categories according to one embodiment of the present 
invention. In this embodiment, at step 310, processing loops 
until the predetermined evaluation time period has expired. 
For example, a test may be performed to determine if the 

4S current time is greater than or equal to the last evaluation 
time plus the predetermined evaluation time interval. 
Alternatively, the evaluation process may be triggered by an 
interrupt. In any event, when it is time to evaluate the QoS 
queue categorization, processing continues with step 330. 

50 It will be appreciated that the time interval chosen for the 
predetermined evaluation time period should not be too long 
or too short. If the time interval is too long, one QoS queue 
might be allowed to. monopolize the link until its maximum 
bandwidth is achieved while other QoS queues remain idle. 

55 If the time interval is too short, transmitting a single packet 
or remaining idle for a single packet time may cause the QoS 
queue to become a member of a different QoS category (e.g., 
the single transmission may cause the current bandwidth to 
exceed the maximum bandwidth or the single idle time may 

60 cause the current bandwidth to fall below the minimum 
bandwidth) because the moving average moves very quickly 
over short time intervals. 

At step 330, the current bandwidth for a particular QoS 
queue is set to the current bandwidth for that QoS queue as 

65 calculated in the previous time interval multiplied by a first 
weighting factor plus the actual bandwidth that particular 
QoS queue received in the previous time interval multiplied 
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by a second weighting factor, wherein the weighting factors bandwidth and minimum bandwidth (e.g., Category A), 

may be selected to achieve the desired level of responsive- However, if no QoS queues meet this criteria, Category B is 

ness in the current bandwidth metric. For example, it may be selected. 

desirable to have the current bandwidth converge to within At step 430, if multiple QoS queues are members of the 

a certain percentage of a sustained bandwidth if that band- 5 selected QoS category, processing continues with step 440, 

width has been sustained for a certain amount of time. otherwise, processing branches to step 470. 

Exemplary weighting factors are in the form (w-l)/w and M ^ ^ dori{ks Qf ^ QqS g ^ 

1/w, respectively. Usmg weighting factors of ^/id for me first used to ^ a ^ QoS ^ of ^ 

weighting factor and a value of Vfc for the second weighting cat ^ have di daU 

tactor, tor example, the current band WLdth will reflect 50% io . .__ . 

of a step within 13 time iotervals, 80% of a step within 27 M . ste P 450 ' 11 two or more QoS 9 ueues same 

time intervals, and will be within 2% of the sustained pnonty then processing continues with step 460. Otherwise, 

bandwidth in approximately 63 time intervals (assuming a ^ a QoS ^ eue » fi " md to have lhe hl S hest relative P™*** 

maximum and peak bandwidth of 100%). Alternative ratios ^ en processing branches to step 470. 

and current bandwidth metrics will be apparent to those of 15 At ste P 46 °. the & is resolved by performing round robin 

ordinary skill in the art. or LRU scheduling. That is, until the QoS categories are 

After the current bandwidth has been evaluated for a QoS evaluated, the QoS queues having the same priority will be 

queue, at step 340, the QoS queue bandwidth parameters can «* ate ? ih ™& in a Predetermined order or scheduled such 

be compared to the current bandwidth to determine to which that the 9 oS ^ ueu f not P r0V1 ? ed a P acket for 

QoS category the QoS queue belongs. As described above, 20 transmission recently will be given such an opportunity, 

if (CURR__BW<PEAK_BW) and (CURR_BW<MIN_ ^ a Q° s ^ eue 10 thls manner > processing 

BW), then the QoS queue is associated with Category A at continues with step 470. 

step 350. If (CURR_BAV^MIN_BW) and ((CURR__ At step 470, a packet is dequeued from the selected QoS 

BW<MAX_BW) and (CURR_BW<PEAK_BW)), then queue and the packet is transmitted by the port at step 480. 

the QoS queue is associated with Category B at step 360. If 25 This scheduling process may be repeated by looping back to 

(CURR_BW^PEAKJBW) or (CURR_BW£MAX_ step 410, as illustrated. 

BW), then the QoS queue is associated with Category C at ^ . „ t 

step 370. Queuing Schemes 

At step 380, if all of the QoS queues have been evaluated, A variety of different queuing mechanisms may be imple- 

then processing branches to step 310; otherwise, processing 30 mented using various combinations of the QoS profile 

continues with step 330. attributes discussed above. Table 2 below illustrates how to 

achieve exemplary queuing mechanisms and corresponding 

Scheduling Processing configurations of the QoS profile attributes. 

Briefly, at each port, three levels of arbitration may be 35 

employed to select the appropriate QoS queue from which to TABLE 2 

transmit the next packet. The first level of arbitration selects Queuing Mechanism Configurations 

among the QoS categories. Category A is given priority if 

any member QoS queues have one or more pending packets. Queuing Mechanism QoS Profile Attribute Value 

Otherwise, a QoS queue with one or more pending packets ^ stlict Priority Qucuing Mkimum Bandwidth - 0% 

of Category B is selected. According to one embodiment, the Maximum Bandwidth - 100% 

relative priority assigned to each QoS queue may be used as Peak Bandwidth - 100% 

a second level of arbitriation. In this manner, when multiple Maximum Delay « n/a 

QoS qLueLies satisfy the first level arbitration, a higher B . „ . . _ , * elative ¥ i otit J T ™ < ^SF Yl 

. .7 rt „ f „ , 4 . . « „ Round Robm/Least Minimum Bandwidth =» 0% 

priority QoS queue IS favored over a lower pnonty QoS 45 Recently Used Queuing Maximum Bandwidth = 100% 

queue. Finally, when there is a tie at the second level of Peak Bandwidth - 100% 

arbitration (e.g., two or more QoS queues in the same QoS Maximum Delay «= n/a 

category have the same relative priority), a round robin or . 5 elative *f*J ~ A T™<f£ *" queues> 

, b J , , ^ _^ _ _^ , r t , - Weighted Fair Queuing Minimum Bandwidth - >0% 

least recently used (LRU) scheme may be employed to Maximum Bandwidth = max_bw£ 

select from among the two or more QoS queues until the 50 Peak Bandwidth - PEAK^BWj 

QoS categories are evaluated. Maximum Delay - n/a 

Assuming a periodic evaluation of QoS categories is 

being performed, the scheduling processing need not include 

such evaluation and the scheduling processing may be PRIORITY,- represents a programmable priority value for 

performed as illustrated by FIG. 4, according to one embodi- 55 a particular QoS queue, i. Similarly, MAX__BW ( . and 

ment of the present invention. In the embodiment depicted, PEAK„BW i represent programmable maximum band- 

at step 410, processing loops until the port associated with widths and peak bandwidths, respectively, for a particular 

the group of QoS queues being evaluated indicates it is ready QoS queue, i. 

to receive the next packet for transmission. For example, the For a strict priority scheme, each QoS queue's minimum 

port may be polled to determine its transmission status. 60 bandwidth is set to zero percent, each QoS queue's maxi- 

Alternatively, the scheduling process may be triggered by an mu m bandwidth is set to one hundred percent, and each QoS 

interrupt. In any event, when the port is ready for the next queue's peak is set to one hundred percent. In this manner, 

packet, processing continues with step 420. the current bandwidth will never be less than the minimum 

At step 420, a QoS category is selected from which a QoS bandwidth, and the current bandwidth will never exceed 

queue will provide the next packet for transmission. As 65 either the peak bandwidth or the maximum bandwidth. In 

described above, priority is given to the category containing this configuration, all QoS queues will be associated with 

QoS queues with pending data that are below the peak Category B since no QoS queues will satisfy the criteria of 



Relative Priority « <same for all queues> 
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either Category A or Category B. Ultimately, by configuring 
the QoS profile attributes in this manner, the second level of 
arbitration (e.g., the relative priority of' the QoS queues) 
determines which QoS queue is to source the next packet. 

For a pure round robin or least recently used (LRU) 5 
scheme, the QoS profile attributes are as above, but addi- 
tionally all QoS queue priorities are set to the same value. In 
this manner, the third level of arbitration determines which 
QoS queue is to source the next packet. 

Finally, weighted fair queuing can be achieved by 10 
assigning, at least, a value greater than zero percent to the 
desired minimum bandwidth. By assigning a value greater 
than zero to the minimum bandwidth parameter, the particu- 
lar QoS queue is assured to get at least that amount of 
bandwidth on average because the QoS queue will be 15 
associated with Category A until at least its minimum 
bandwidth is satisfied. Additionally, different combinations 
of values may be assigned to the peak and maximum 
bandwidths to prevent a particular QoS queue from monopo- 
lizing the link. 20 

Alternative Embodiments 

While evaluation of QoS categories has been described 
above as occurring periodically, this evaluation may also be ^ 
triggered by the occurrence of a predetermined event. 
Alternatively, evaluation of QoS categories may take place 
as part of the scheduling processing rather than as part of a 
separate periodic background process. 

While a relationship between the number of priority levels 30 
and the number of QoS queues has been suggested above, it 
is appreciated that the number of QoS queues may be 
determined independently of the number of priority levels. 
Further, it is appreciated that the number of QoS queues 
provided at each port may be fixed for every port or 3S 
alternatively a variable number of QoS queues may be 
provided for each port. 

Finally, in alternative embodiments, weighting factors 
and ratios other than those suggested herein may be used to 
adjust the current bandwidth calculation for a particular 40 
implementation. 

In the foregoing specification, the invention has been 
described with reference to specific embodiments thereof. It 
will, however, be evident that various modifications and 
changes may be made thereto without departing from the 45 
broader spirit and scope of the invention. The specification 
and drawings are, accordingly, to be regarded in an illus- 
trative rather than a restrictive sense. 

What is claimed is: 

1. A method for bandwidth management in a packet 50 
forwarding device, comprising: 

identifying a quality of service (QoS) metric correspond- 
ing to a traffic group, the QoS metric defining a 
minimum QoS for the traffic group; 

receiving a data packet associated with the traffic group; 55 

placing the data packet into one of a plurality of queues; 

identifying a current measure of network performance 
with respect to parameters specified in the QoS metric; 

and 60 
removing the data packet from the queue if a difference 
between the current measure and the minimum QoS 
falls within a threshold. 

2. The method of claim 1 wherein identifying the QoS 
metric corresponding to a traffic group further comprises: 55 

identifying the traffic group through an Internet Protocol 
(IP) subnet membership identifier; and 
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determining a corresponding QoS metric defining a mini- 
mum QoS for the traffic group. 

3. The method of claim 1 wherein identifying the QoS 
metric corresponding to a traffic group further comprises: 

identifying the traffic group through a media access 
control (MAC) address; and 

determining a corresponding QoS metric defining a mini- 
mum QoS for the traffic group. 

4. The method of claim 1 wherein identifying the QoS 
metric corresponding to a traffic group further comprises: 

identifying the traffic group through a virtual local area 
network (VLAN) identifier; and 

determining a corresponding QoS metric defining a mini- 
mum QoS for the traffic group, 

5. The method of claim 1 wherein identifying the QoS 
metric comprises receiving information indicating a mini- 
mum bandwidth for the traffic group. 

6. The method of claim 1 wherein identifying the QoS 
metric comprises receiving information indicating a maxi- 
mum sustained bandwidth for the traffic group. 

7. The method of claim 6 wherein identifying the QoS 
metric comprises receiving information indicating a peak 
bandwidth representing a bandwidth in excess of the maxi- 
mum sustained bandwidth that the traffic group can utilize, 

8. The method of claim 1 wherein identifying the QoS 
metric comprises receiving information indicating a maxi- 
mum allowable delay for the traffic group. 

9. The method of claim 1 wherein identifying the QoS 
metric comprises receiving information indicating a relative 
priority associated with the traffic group. 

10. The method of claim 1 wherein determining a current 
measure of network performance occurs at specified inter- 
vals of time. 

11. The method of claim 1 wherein determining a current 
measure of network performance with respect to parameters 
specified in the QoS metric comprises calculating the current 
measure for the parameters specified in the QoS metric. 

12. The method of claim 1 wherein receiving the data 
packet comprises receiving the data packet on a first port of 
a plurality of ports, and wherein removing the data packet 
from the queue comprises transmitting the data packet from 
a second port of the plurality of ports. 

13. The method of claim 1 wherein the packet forwarding 
device employs a non-deterministic access protocol. 

14. The method of claim 13 wherein the non- 
determninistic access protocol employed by the packet for- 
warding device is the Carrier Sense Multiple Access with 
Collision Detection (CSMA/CD) protocol 

15. An article of manufacture comprising a machine 
accessible medium having content that when accessed pro- 
vides instructions to cause an electronic system to: 

identify a quality of service (QoS) metric corresponding 

to a traffic group, the QoS metric defining a minimum 

QoS for the traffic group; 
receive a data packet associated with the traffic group; 
place the data packet into one of a plurality of queues; 
identify a current measure of network performance with 

respect to parameters specified in the QoS metric; and 
remove the data packet from the queue if a difference 

between the current measure and the minimum QoS 

falls within a threshold. 

16. The article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system 
to identify the QoS metric corresponding to a traffic group 
further comprises the content to provide instructions to 
cause the electronic system to: 
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identify the traffic group through an Internet Protocol (IP) to identify the QoS metric comprises the content to provide 

subnet membership identifier; and instructions to cause the electronic system to receive infor- 

determine a corresponding QoS metric defining a mini- mation indicating a maximum allowable delay for the traffic 

mum QoS for the traffic group. group. 

17. The article of manufacture of claim 15 wherein the 5 23. The article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system content to provide instructions to cause the electronic system 
to identify the QoS metric corresponding to a traffic group to identify the QoS metric comprises the content to provide 
further comprises the content to provide instructions to instructions to cause the electronic system to receive infor- 
cause the electronic system to: mation indicating a relative priority associated with the 

identify the traffic group through a media access control 10 traffic group. 

(MAC) address; and 24. The article of manufacture of claim 15 wherein the 

determine a corresponding QoS metric defining a mini- content to provide instructions to cause the electronic system „ 

mum QoS for the traffic group. to determine a current measure of network performance 

18. The article of manufacture of claim 15 wherein the 5 comprises the content to provide instructions to cause the 
content to provide instructions to cause the electronic system electronic system to determine the current measure at speci- 
to identify the QoS metric corresponding to a traffic group fi e d intervals of time. 

further comprises the content to provide instructions to 25. The article of manufacture of claim 15 wherein the 

cause the electronic system to: content to provide instructions to cause the electronic system 

identify the traffic group through a virtual local area 20 to determine a current measure of network performance with 

network (VLAN) identifier; and respect to parameters specified in the QoS metric comprises 

determine a corresponding QoS metric defining a mini- the content to provide instructions to cause the electronic 

mum QoS for the traffic group. system to calculate the current measure for the parameters 

19. The article of manufacture of claim 15 wherein the specified in the QoS metric. 

content to provide instructions to cause the electronic system 25 26. The article of manufacture of claim 15 wherein the 

to identify the QoS metric comprises the content to provide content to provide instructions to cause the electronic system 

instructions to cause the electronic system to receive infor- t0 rece ive the QoS metric comprises the content to provide 

mation indicating a minimum bandwidth for the traffic instructions to cause the electronic system to receive the data 

gro Ji p * . , , r r packet on a first port of a plurality of ports, and wherein the 

20. The article of manufacture of claim 15 wherein the 30 to provide i^tructions to cause the electronic system 
content to provide instructions to cause the electronic system tQ remove ^ d ^ ket from ^ the 
to identify the QoS metric composes the content to provide tQ £ instnlctions to cause the electronic tem 

instructions to cause the electronic system to receive infor- 4 A .5. , 4 , _ , ^ , 

. . . r - , , c to transmit the data packet from a second port of the plurality 

mation indicating a maximum sustained bandwidth for the Y v v J 

traffic group. 35 of P orts ' 

21. The article of manufacture of claim 20 wherein the 27 ' ™ e article of manufacture of claim 15 wherein the 
content to provide instructions to cause the electronic system P acket forwarding device employs a non-deterministic 
to identify the QoS metric comprises the content to provide access protocol, 

instructions to cause the electronic system to receive infor- 28 - article of manufacture of claim 27 wherein the 

mation indicating a peak bandwidth representing a band- 40 non-deterministic access protocol employed by the packet 

width in excess of the maximum sustained bandwidth that forwarding device is the Carrier Sense Multiple Access with 

the traffic group can utilize. Collision Detection (CSMA/CD) protocol. 

22. The article of manufacture of claim 15 wherein the 

content to provide instructions to cause the electronic system ***** 
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